Towards realistic codon models: among site variability and dependency of synonymous and non-synonymous rates
نویسندگان
چکیده
Codon evolutionary models are widely used to infer the selection forces acting on a protein. The non-synonymous to synonymous rate ratio (denoted by Ka/Ks) is used to infer specific positions that are under purifying or positive selection. Current evolutionary models usually assume that only the non-synonymous rates vary among sites while the synonymous substitution rates are constant. This assumption ignores the possibility of selection forces acting at the DNA or mRNA levels. Towards a more realistic description of sequence evolution, we present a model that accounts for among-site-variation of both synonymous and non-synonymous substitution rates. Furthermore, we alleviate the widespread assumption that positions evolve independently of each other. Thus, possible sources of bias caused by random fluctuations in either the synonymous or non-synonymous rate estimations at a single site is removed. Our model is based on two hidden Markov models that operate on the spatial dimension: one describes the dependency between adjacent non-synonymous rates while the other describes the dependency between adjacent synonymous rates. The presented model is applied to study the selection pressure across the HIV-1 genome. The new model better describes the evolution of all HIV-1 genes, as compared to current codon models. Using both simulations and real data analyses, we illustrate that accounting for synonymous rate variability and dependency greatly increases the accuracy of Ka/Ks estimation and in particular of positively selected sites. Finally, we discuss the applicability of the developed model to infer the selection forces in regulatory and overlapping regions of the HIV-1 genome.
منابع مشابه
Molecular evolution of the ent-kaurenoic acid oxidase gene in Oryzeae
We surveyed the substitution patterns in the ent-kaurenoic acid oxidase (KAO) gene in 11 species of Oryzeae with an outgroup in the Ehrhartoidaea. The synonymous and non-synonymous substitution rates showed a high positive correlation with each other, but were negatively correlated with codon usage bias and GC content at third codon positions. The substitution rate was heterogenous among lineag...
متن کاملMutational Pressure Drives Evolution of Synonymous Codon Usage in Genetically Distinct Oenothera plastomes
Background: Most of the amino acids are encoded by more than one codon, termed as synonymous codons. Synonymous codon usage is not random as it is unique to species. In each amino acid family, some synonymous codons are preferred and this is referred to as synonymous codon usage bias (SCUB). Trends associated with evolution of SCUB and factors influencing its diversification in plastomes of gen...
متن کاملIdentification of a Rare Synonymous Beta Globin Mutation, HBB:c.180G>A codon 59 (G>A) in an Iranian Patient
Beta thalassemia is the most common autosomal recessive disorder. The present study reports a rare β globin gene mutation, HBB: c.180G>A: codon 59 (AAG/AAA), in a patient from Gilan province, northern Iran. Nucleotide sequencing of amplified DNA belonging to a 35 years old man presenting mild hypochromia revealed a synonymous mutation due to a G>A conversion at the third position of codon 59 o...
متن کاملThe rate of synonymous substitution in enterobacterial genes is inversely related to codon usage bias.
Genes sequences from Escherichia coli, Salmonella typhimurium, and other members of the Enterobacteriaceae show a negative correlation between the degree of synonymous-codon usage bias and the rate of nucleotide substitution at synonymous sites. In particular, very highly expressed genes have very biased codon usage and accumulate synonymous substitutions very slowly. In contrast, there is litt...
متن کاملIdentification of Synonymous Codon Usage Bias in the Pseudorabies Virus UL31 Gene
Background: Little knowledge of synonymous codon usage pattern of pseudorabies virus (PRV) genome, especially the UL31 gene in the process for its evolution is available. Objectives: In the present study, the codon usage bias between PRV UL31 sequence and the UL31-like sequences was identified. Materials and Methods: We used a comprehensive analysi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 23 13 شماره
صفحات -
تاریخ انتشار 2007